# f / 

O/ 

(IV GENERAL INFORMATION : 

(i) APPLICANT: Schlesainger , Joseph 
Sap, Jan M. 

(ii) TITLE OP INVENTION: NOVEL RECEPTOR-TYPE PHOSPHOTYROSINE 
PHOSPHATASE— ALPHA 

(ill) NUMBER OF SEQUENCES : 14 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: PENNIE & EDMONDS 

(B) STREETS 1155 AVENUE OF THE AMERICAS 

(C) CITY: NEW YORK 

(D) STATE: NEW YORK 

(E) COUNTRY: U.S. A* 

(F) ZIP: 10036 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: Patentln Release #1.0, Version #1.25 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: US 08/015,985 

(B) FILING DATE: 10-FEB-1993 

(C) CLASSIFICATION: 

(viii) ATTORNEY/ AGENT INFORMATION: 

(A) NAME: Coruzzi, Laura A. . 

(B) REGISTRATION NUMBER: 30,742 

(C) REFERENCE/DOCKET NUMBER: 7683-020 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: (212) 790-9090 

(B) TELEFAX: (212) 869-9741/8864 

(C) TELEX: 66141 PENNIE 



-71- 

SEQUENCE LISTING 



(2) INFORMATION FOR SEQ ID NO:l: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 802 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 

Met Asp Ser Trp Phe lie Leu Val Leu Leu Gly Ser Gly Leu lie Cys 
* 5 10 ~ 15 

Val Ser Ala Asn Asn Ala Thr Thr Val Ala Pro Ser Val Gly lie Thr 
20 25 30 

Arg Leu lie Ash Ser Ser Thr Ala Glu Pro Val Lys Glu Glu Ala Lys 
35 40 45 

Thr Ser Asn Pro Thr Ser Ser Leu Thr Ser Leu Ser Val Ala Pro Thr 
50 55 60 
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Phe Ser Pro Asn lie Thr Leu Gly Pro Thr Tyr Leu Thr Thr Val Asn 
65 70 75 80 

Ser Ser Asp Ser Asp Asn Gly Thr Thr Arg Thr Ala Ser Thr Asn Serx 

85 90 95 

lie Gly lie Thr lie Ser Pro Asn Gly Thr Trp Leu Pro Asp Asn Gin 
100 105 110 

Phe Thr Asp Ala Arg Thr Glu Pro Trp Glu Gly Asn Ser Ser Thr Ala 
115 . 120 125 

Ala Thr Thr Pro Glu Thr Phe Pro Pro Ser Gly Asn Ser Asp Ser Lys 
130 135 140 

Asp Arg Arg Asp Glu Thr Pro lie lie Ala Val Met Val Ala Leu Ser 
145 ISO 155 160 

Ser Leu Leu Val He Val Phe He He lie Val Leu Tyr Met: Leu Arg 
165 170 175 

Phe Lys Lys Tyr Lys Gin Ala Gly Ser His Ser Asn Ser Phe Arg Leu 
180 185 190 

Ser Asn Gly Arg Thr Glu Asp Val Glu Pro Gin Ser Val Pro Leu Leu 
195 200 205 

Ala Arg Ser Pro Ser Thr Asn Arg Lys Tyr Pro Pro Leu Pro Val Asp 
210 215 220 

Lys Leu Glu Glu Glu lie Asn Arg Arg Met Ala Asp Asp Asn Lys Leu 
225 230 235 240 

Phe Arg Glu Glu Phe Asn Ala Leu Pro Ala Cys Pro lie Gin Ala Thr 
245 250 255 

Cys Glu Ala Ala Ser Lys Glu Glu Asn Lys Glu Lys Asn Arg Tyr Val- 
260 265 270 

Asn lie Leu Pro Tyr Asp His Ser Arg Val His Leu Thr Pro Val Glu 
275 280 285 

Gly Val Pro Asp Ser Asp Tyr lie Asn Ala Ser Phe He Asn Gly Tyr 
290 295 300 

Gin Glu Lys Asn Lys Phe lie Ala Ala Gin Gly Pro Lys Glu Glu Thr 
305 310 315 320 

Val Asn Asp Phe Trp Arg Met He Trp Glu Gin Asn Thr Ala Thr He 
325 330 335 

Val Met Val Thr Asn Leu Lys Glu Arg Lys Glu Cys Lys Cys Ala Gin 
340 345 350 

Tyr Trp Pro Asp Gin Gly Cys Trp Thr Tyr Gly Asn He Arg Val Ser 
355 360 365 

Val Glu Asp Val Thr Val Leu Val Asp Tyr Thr Val Arg Lys Phe Cys 
370 375 380 

He Gin Gin Val Gly Asp Met Thr Asn Arg Lys Pro Gin Arg Leu He 
385 390 395 400 

Thr Gin Phe His Phe Thr Ser Trp Pro Asp Phe Gly Val Pro Phe Thr 
405 410 415 

Pro He Gly Met Leu Lys Phe Leu Lys Lys Val Lys Ala Cys Asn Pro 
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420 425 430 

Gin Tyr Ala Gly Ala lie Val Val His Cyo Ser Ala Gly Val Gly Arg 
435 440 445 

Thr Gly Thr Phe Val Val lie Asp Ala Met Leu Asp Met Met His Thr 
450 455 460 

Glu Arg Lys Val Asp Val Tyr Gly Phe Val Ser Arg He Arg Ala Gin 
465 470 475 " 480 

m 

Arg Cys Gin Met Val Gin Thr Asp Met Gin Tyr Val Phe lie Tyr Gin 
485 490 495 

Ala Leu Leu Glu His Tyr Leu Tyr Gly Asp Thr Glu Leu Glu Val Thr 
500 505 510 

Ser Leu Glu Thr His Leu Gin Lys lie Tyr Asn Lys lie Pro Gly Thr 
515 520 525 

Ser Asn Asn Gly Leu Glu Glu Glu Phe Lys Lys Leu Thr Ser He Lys 
530 535 540 

He Gin Asn Asp Lys Met Arg Thr Gly Asn Leu Pro Ala Asn Met Lys 
545 550 555 560 

Lys Asn Arg Val Leu Gin He He Pro Tyr Glu Phe Asn Arg Val He 
565 570 575 

He Pro Val Lys Arg Gly Glu Glu Asn Thr Asp Tyr Val Asn Ala Ser 
580 585 590 

Phe He Asp Gly Tyr Arg Gin Lys Asp Ser Tyr He Ala Ser Gin Gly 
595 600 605 

Pro Leu Leu His Thr lie Glu Asp Phe Trp Arg Met He Trp Glu Trp 
610 615 620 

Lys Ser Cys Ser He Val Met Leu Thr Glu Leu Glu Glu Arg Gly Gin 
625 630 635 640 

Glu Lys Cys Ala Gin Tyr Trp Pro Ser Asp Gly Leu Val Ser Tyr Gly 
645 650 655 

Asp He Thr Val Glu Leu Lys Lys Glu Glu Glu Cys Glu Ser Tyr Thr 
660 665 670 

Val Arg Asp Leu Leu Val Thr Asn Thr Arg Glu Asn Lys Ser Arg Gin 
675 680 685 

He Arg Gin Phe His Phe His Gly Trp Pro Glu Val Gly He Pro Ser 
690 695 700 

Asp Gly Lys Gly Met lie Ser He He Ala Ala Val Gin Lys Gin Gin 
705 710 715 720 

Gin Gin Ser Gly Asn His Pro He Thr Val His Cys Ser Ala Gly Ala 
725 730 735 

Gly Arg Thr Gly Thr Phe Cys Ala Leu Ser Thr Val Leu Glu Arg Val 
740 745 750 

Lys Ala Glu Gly He Leu Asp Val Phe Gin Thr Val Lys Ser Leu Arg 
755 760 765 

Leu Gin Arg Pro His Met Val Gin Thr Leu Glu Gin Tyr Glu Phe Cys 
770 775 780 
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Tyr Lys Val Val Gin Glu Tyr lie Asp Ala Phe Ser Asp Tyr Ala Asn 
785 790 795 * 800 

Phe Lys 

(2) INFORMATION FOR SEQ ID NO: 2s 

(i) SEQUENCE CHARACTERISTICS t 

(A) LENGTH: 2409 base, pairs 

(B) TYPE* nucleic acid 

(C) STRAND ED NESS : double 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 2: 

ATGGATTCCT GGTTCATTCT TGTTCTGCTC GGCAGTGGTC TGATATGTGT CAGTGCCAAC 60 

AATGCTACCA CAGTTGCACC TTCTGTAGGA ATTACAAGAT TAATTAACTC ATCAACGGCA 120 

GAACCAGTTA AAGAAGAGGC CAAAACTTCA AATCCAACTT CTTCACTAAC TTCTCTTTCT 180 

GTGGCACCAA CATTCAGCCC^AAATATAACT CTGGGACCCA CCTATTTAAC CACTGTCAAT 240 

TCTTCAGACT CTGACAATGG GACCACAAGA ACAGCAAGCA CCAATTCTAT AGGCATTACA 300 

ATTTCACCAA ATGGAACGTG G CTTCCAG AT AACCAGTTCA CGGATGCCAG AACAGAACCC 360 

TGGGAGGGGA ATTCCAGCAC CGCAGCAACC ACTCCAGAAA CTTTCCCTCC TTCAGGTAAT 420 

TCTGACTCGA AGGACAGAAG AGATGAGACA CCAATTATTG CGGTGATGGT GGCCCTGTCC 480 

TCTCTGCTAG TGATCGTGTT TATTATCATA GTTTTGTACA TGTTAAGGTT TAAGAAATAC 540 

AAGCAAGCTG GGAGCCATTC CAATTCTTTC CGCTTATCCA ACGGCCGCAC TGAGGATGTG 600 

GAGCCCCAGA GTGTGCCACT TCTGGCCAGA TCCCCAAGCA CCAACAGGAA ATACCCACCC 660 



CTGCCCGTGG 


ACAAGCTGGA AGAGGAAATT 


AACCGGAGAA 


TGGCAGACGA 


CAATAAGCTC 


720 


TTCAGGGAGG 


AATTCAACGC TCTCCCTGCA 


TGTCCTATCC 


AGGCCACCTG 


TGAGGCTGCT 


780 


TCCAAGGAGG 


AAAACAAGGA 


AAAAAATCGA 


TATGTAAACA 


TCTTGCCTTA 


TGACCACTCT 


840 


AGAGTCCACC 


TGACACCGGT 


TGAAGGGGTT 


CCAGATTCTG 


ATTACATCAA 


TGCTTCATTC 


900 


ATCAACGGTT 


ACCAAGAAAA 


GAACAAATTC 


ATTGCTGCAC 


AAGGACCAAA 


AGAAGAAACG 


960 


GTGAATGATT 


TCTGG CGG AT 


GATCTGGGAA 


CAAAACACAG 


CCACCATCGT 


CATGGTTACC 


1020 


AACCTGAAGG 


AGAGAAAGGA 


GTGCAAGTGC 


GCCCAGTACT 


GGCCAGACCA 


AGGCTGCTGG 


1080 


ACCTATGGGA 


ATATTCGGGT 


GT CTGTAG AG 


GATGTGACTG 


TCCTGGTGG A 


CTACACAGTA 


1140 


CGGAAGTTCT 


GCATCCAGCA 


GGTGGGCGAC 


ATGACCAACA 


GAAAGCCACA 


GCGCCTCATC 


1200 


ACTCAGTTCC 


ACTTTACCAG 


CTGGCCAGAC 


TTTGGGGTGC 


CTTTTACCCC 


GATCGGCATG 


1260 


CTCAAGTTCC 


TCAAGAAGGT 


GAAGGCCTGT 


AACCCTCAGT 


ATGCAGGGGC 


CATCGTGGTC 


1320 


CACTG CAGTG 


CAGGTGTAGG 


GCGTACAGGT 


ACCTTTGTCG 


TCATTGATGC 


CATGCTGGAC 


1380 


ATGATGCATA 


CAGAACGGAA 


GGTGGACGTG 


TATGGCTTTG 


TGAGCCGGAT 


CCGGGCACAG 


1440 


CGCTGCCAGA 


TGGTGCAAAC 


CGATATGCAG 


TATGTCTTCA 


TATACCAAGC 


CCTTCTGGAG 


1500 



-75- 



CATTATCTCT 


ATGGAGATAC 


AGAACTGGAA 


GTGACCTCTC 


TAGAAACCGA 


C CTG GAG AAA 


1560 


ATTTACAACA 


AAATCCGAGG 


GACCAGCAAC 


AATGGATTAG 


AGGAGGAGTT 


TAAGAAGTTA 


1620 


ACATCAATCA 


AAATCCAGAA 


TGACAAGATG 


CGGACTGGAA 


ACCTTCCAGC 


CAACATGAAG 


1680 


AAGAACCGTG 


TTTTACAGAT 


CATTCCATAT 


GAATTCAACA 


GAGTGATCAT 


TCCAGTTAAG 


1740 


CGGGGCGAAG AGAATACAGA CTATGTGAAC GCATCCTTTA TTGATGGCTA CCGGCAGAAG 


180O 


GACTCCTATA 


TCGCCAGOCA 


GGGCCCTCTT 


CTCCACACAA 


TTGAGGACTT 


CTGGCGAATG 




ATCTGGGAG J. 


/"•/"• TPPHV 
wAAA 1 lur 




ATGCTAACAG 




GAGAGGCCAw 




GAGAAGTGTG 


CCCAGTACTG 


GCCATCTGAT 


GGACTGGTGT 


CCTATGGAGA 


TATTACAGTG 




GAACTGAAGA 


AGGAGGAGGA 


ATGTGAGAGC 


TACACCGTCC 


GAGACCTCCT 


GGTCACCAAC 




ACCAGGGAGA 


ATAAGAGCCG 


GCAGATCCGG 


CAGTTCCACT 


TCCATGGCTG 


GCCTGAAGTG 


9 1 on 


GGCATCCCCA 


GTGACGGAAA 


GGGCATGATC 


AGCATCATCG 


CCGCCGTGCA 


GAAGCAGCAG 




CAGCAGTCAG 


GGAACCACCC 


CATCACCGTG 


CACTGCAGCG 


CCGGGGCAGG 


AAGGACGGGG 


2220 


ACCTTCTGTG 


CCCTGAGCAC 


CGTCCTGGAG 


CGTGTGAAAG 


CAGAGGGGAT 


TTTGGATGTC 


2280 


TTCCAGACTG 


TCAAGAGCCT 


GCGGCTACAG 


AGGCCACACA 


TGGTCCAGAC 


ACTGGAACAG 


2340 


TATGAGTTCT 


GCTACAAGGT 


GGTGCAGGAG 


TATATTGATG 


CATTCTCAGA 


TTATGCCAAC 


2400 


TTCAAGTAA 












2409 



(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 793 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:3: 

Met Asp Ser Trp Phe lie Leu Val Leu Phe Gly Ser Gly Leu lie His 
1 5 10 15 

Val Ser Ala Asn Asn Ala Thr Thr Val Ser Pro Ser Leu Gly Thr Thr 
20 25 30 

Arg Leu lie Lys Thr Ser Thr Thr Glu Leu Ala Lys Glu Glu Asn Lys 
35 40 45 

Thr Ser Asn Ser Thr Ser Ser Val lie Ser Leu Ser Val Ala Pro Thr 
50 55 60 

Phe Ser Pro Asn Leu Thr Leu Glu Pro Thr Tyr Val Thr Thr Val Asn 
65 70 75 80 

Ser Ser His Ser Asp Asn Gly Thr Arg Arg Ala Ala Ser Thr Glu Ser 
85 90 95 

Gly Gly Thr Thr lie Ser Pro Asn Gly Ser Trp Leu lie Glu Asn Gin 
100 105 110 

Phe Thr Asp Ala lie Thr Glu Pro Trp Glu Gly Asn Ser Ser Thr Ala 
115 120 125 
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Ala Thr Pro Glu Thr Phe Pro Pro Ala Asp Glu Thr Pro lie lie 

130 " 135 140 

Ala Val Met Val Ala Leu Ser Ser Leu Leu Val lie Val Phe lie lie 
145 ISO 155 160 

lie Val Leu Tyr Met Leu Arg Phe Lys Lys Tyr Lys Gin Ala Gly Ser 
lg S 170 ~ 175 

His Ser Asn Ser Phe Arg Leu Ser Asn Gly Arg Thr Glu Asp Val Glu 
180 . 185 190 

Pro Gin Ser Val Pro Leu Leu Ala Arg Ser Pro Ser Thr Asn Arg Lys 
195 200 205 

T Y r Pf« Pro Leu Pro Val Asp Lys Leu Glu Glu Glu lie Asn Arg Arg 
210 215 220 • . 

Met Ala Asp Asp Asn Lys Leu Phe Arg Glu Glu Phe Asn Ala Leu Pro 
225 230 235 240 

Ala Cys Pro He Gin Ala Thr Cys Glu Ala Ala Ser Lys Glu Glu Asn 
245 250 * 255 

Lys Glu Lys Asn Arg Tyr Val Asn He Leu Pro Tyr Asp His Ser Arg 
260 265 270 

Val His Leu Thr Pro Val Glu Gly Val Pro Asp Ser Asp Tyr lie Asn 
275 280 285 

Ala fff Phe Ile Asn Gly Gln Glu L y s Asn Lys Phe He Ala Ala 

290 295 300 

Gin Gly Pro Lys Glu Glu Thr Val Asn Asp Phe Trp Arg Met lie Trp 
305 310 315 3 20 

Glu Gin Asn Thr Ala Thr He Val Met Val Thr Asn Leu Lys Glu Arg • 

325 330 . 335 

Lys Glu Cys Lys Cys Ala Gin Tyr Trp Pro Asp Gin Gly Cys Trp Thr 
340 345 3so 

Tyr Gly Asn Val Arg Val Ser Val Glu Asp Val Thr Val Leu Val Asp 
355 360 365 

■ Iyr I£« Val LyS Phe Ser Ile Gln Gln Val Gly Asp Val Thr Asn 

370 375 380 

Arg Lys Pro Gln Arg Leu Ile Thr Gln Phe His Phe Thr Ser Trp Pro 
385 390 395 * 400 

Asp Phe Gly Val Pro Phe Thr Pro He Gly Met Leu Lys Phe Leu Lys 
405 410 415 

Lys Val Lys Ala Cys Asn Pro Gln Tyr Ala Gly Ala He Val Val His 
420 42S 430 

Cys Ser Ala Gly Val Gly Arg Thr Gly Thr Phe Val Val He Asp Ala 
435 440 445 

Met Leu Asp Met Met His Ser Glu Arg Lys Val Asp Val Tyr Gly Phe 
450 455 460 

Val Ser Arg He Arg Ala Gln Arg Cys Gln Met Val Gln Thr Asp Met 
465 470 475 480 

Gln Tyr Val Phe Ile Tyr Gln Ala Leu Leu Glu His Tyr Leu Tyr Gly 
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485 490 495 

Asp Thr Glu Leu Glu Val Thr Ser Leu Glu Thr His Leu Gin Lys He 
500 505 510 

Tyr Asn Lys He Pro Gly Thr Ser Asn Asn Gly Leu Glu Glu Glu Phe 
515 520 525 

Lys Lys Leu Thr Ser He Lys He Gin Asn Asp Lys Met Arg Thr Gly 
530 535 540 

Asn Leu Pro Ala Asn Met Lys Lys Asn Arg Val Leu Gin He He Pro 
545 550 555 560 

Tyr Glu Phe Asn Arg Val lie He Pro Val Lys Arg Gly Glu Glu Asn 
565 570 575 

Thr Asp Tyr Val Asn Ala Ser Phe lie Asp Gly Tyr Arg Gin Lys Asp 
580 585 590 

Ser Tyr He Ala Ser Gin Gly Pro Leu Leu His Thr He Glu Asp Phe 
595 600 605 

Trp Arg Met lie Trp Glu Trp Lys Ser Cys Ser lie Val Met Leu Thr 
610 615 620 

Glu Leu Glu Glu Arg Gly Gin Glu Lys Cys Ala Gin Tyr Trp Pro Ser 
625 630 635 640 

Asp Gly Leu Val Ser Tyr Gly Asp He Thr Val Glu Leu Lys Lys Glu 
645 650 655 

Glu Glu Cys Glu Ser Tyr Thr Val Arg Asp Leu Leu Val Thr Asn Thr 
660 665 670 

Arg Glu Asn Lys Ser Arg Gin lie Arg Gin Phe His Phe His Gly Trp 
675 680 685 

Pro Glu Val Gly He Pro Ser Asp Gly Lys Gly Met He Asn He lie 
690 695 700 

Ala Ala Val Gin Lys Gin Gin Gin Gin Ser Gly Asn His Pro He Thr 
705 .710 715 720 

Val His Cys Ser Ala Gly Ala Gly Arg Thr Gly Thr Phe Cys Ala Leu 
725 730 735 

Ser Thr Val Leu Glu Arg Val Lys Ala Glu Gly He Leu Asp Val Phe 
740 745 750 

Gin Thr Val Lys Ser Leu Arg Leu Gin Arg Pro His Met Val Gin Thr 
755 760 765 

Leu Glu Gin Tyr Glu Phe Cys Tyr Lys Val Val Gin Glu Tyr He Asp 
770 775 780 

Ala Phe Ser Asp Tyr Ala Asn Phe Lys 
785 790 



(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2872 base pairs 
(8) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: unknown 
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(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

GAATTCCGGC GAGTGAGGCG CTGACAGGGA CTCGCGGGGG CATCTTGCAC AGACCCCTGG 60 

ACCACGCCGC CATCGCAGCC TCCAGCCCAG TCCTCTCTCT GCCGCTTCTC CTCGCCATGG 120 

AGGCCGCCGA CCGCCGTCCG CGGGCTTCGA GCAGCGGACC GGGCCGGGCT GACCCCATGT 180 

GGGCCGAGAG CCCGGTCQTG AGGCGGAGCT GCCGTGCGCG TCCCCCGCGG TCCCGCCCCA 240 

GCGCCGGGCT CGGTCAGCAT GGATTCCTGG TTCATTCTTG TCCTGTTTGG CAGTGGTCTA 300 

ATACATGTTA GTGCCAACAA TGCTACTACA GTTTCACCTT CTTTAGGAAC GACAAGATTA 360 

ATTAAAACAT CAACAACAGA ATTGGCTAAG GAAGAGAATA AAACCTCAAA TTCAACCTCT 420 

TCAGTAATTT CTCTTTCTGT GGCACCAACA TTCAGCCCAA ACCTGACTCT GGAGCCCACC 480 

TATGTGACTA CTGTTAATTC TTCACACTCT GACAATGGGA CCAGGAGGGC AGCCAGCACG 540 

GAATCTGGAG GCACTACCAT TTCCCCGAAC GGAAGCTGGC TTATTGAGAA CCAGTTCACG 600 

GATGCCATAA CAGAACCCTG GGAGGGGAAC TCGAGCACTG CAGCAACCAC TCCAGAAACC 660 

TTCCCCCCGG CAGATGAGAC ACCAATTATT GCGGTGATGG TGGCCCTGTC CTCTCTGCTA 720 

GTAATCGTGT TTATTATCAT AGTTCTGTAC ATGTTAAGGT TTAAGAAATA CAAGCAAGCT 780 

GGGAGTCATT CCAACTCTTT CCGCCTGTCA AATGGCCGCA CGGAGGATGT GGAGCCCCAA 840 

AGTGTACCAC TTCTGGCCAG GTCCCCGAGC ACCAACAGGA AGTACCCACC ACTGCCTGTG 900 

GACAAGCTGG AAGAGGAGAT TAACCGGAGA ATGGCTGATG ACAATAAGCT CTTCAGAGAA 960 

GAATTCAACG CTCTCCCTGC TTGTCCTATC CAGGCCACCT GTGAGGCTGC CTCCAAGGAA 1020 

GAAAACAAGG AAAAAAACCG CTATGTAAAC ATCCTGCCCT ATGACCACTC TAGAGTGCAC 1080 

CTGACACCTG TTGAAGGGGT CCCAGATTCT GATTACATCA ACGCTTCATT CATTAATGGC 1140 

TACCAGG AAA AGAACAAATT CATCGCTGCA CAAGGACCAA AAGAAGAAAC AGTGAATGAC 1200 

TTCTGGAGAA TGATATGGGA ACAAAACACA GCTACTATTG TCATGGTGAC CAACCTGAAG 1260 

GAGAGAAAGG AGTGTAAATG TGCCCAATAC TGGCCAGACC AAGGCTGCTG GACCTATGGG 1320 

AATGTCCGTG TGTCTGTCGA GGATGTGACT GTTCTGGTGG ACTACACAGT ACGGAAATTC 1380 

TCGATCCAGC AGGTGGGCGA CGTGACCAAC AGGAAACCAC AGCGCCTCAT CACTCAGTTC 1440 

CACTTCACCA GCTGGCCAGA CTTTGGGGTG CCTTTCACCC C AATTGG CAT GCTCAAGTTC 1500 

CTCAAGAAGG TGAAGGCCTG TAACCCTCAG TACGCAGGGG CTATCGTGGT CCACTGCAGT 1560 

GCAGGTGTAG GGCGCACTGG CACCTTTGTT GTCATCGATG CCATGCTGGA CATGATGCAT 1620 

TCGGAG CGCA AAGTGGATGT ATATGGGTTT GTGAGCCGGA TCCGGGCCCA GCGCTGCCAG 1680 

ATGGTACAGA CAGACATGCA GTACGTCTTC ATATACCAGG CCCTTCTGGA GCATTATCTG 1740 

TATGGGGACA CAGAACTGGA AGTGACTTCT CTAGAAACCC ACCTACAAAA AATTTATAAC 1800 

AAGATCCCAG GGACTAGCAA CAACGGGTTA GAGGAGGAGT TTAAGAAATT AACTTCAATC 1860 

AAAATCCAGA ATGACAAGAT GCGCACGGGA AACCTTCCAG CCAACATGAA GAAGAACCGG 1920 
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GTTTTACAGA TCATTCCATA TGAATTTAAC AGAGTGATCA TTCCAGTCAA ACGAGGCGAA 1980 

GAGAACACAG ACTATGTGAA CGCATCCTTC ATTGATGGAT ACCGGCAGAA AGACTCCTAC 2040 

ATTGCCAGCC AGGGCCCTCT TCTCCACACG ATTGAGGACT TCTGGCGAAT GATCTGGGAG 2100 

TGGAAGTCCT GTTGTATCGT AATGCTGACA GAACTGGAAG AG AG AG G CCA GGAGAAGTGT 2160 

GCCCAGTACT GGCCATCTGA TGGCCTGGTG TCCTACGGAG ACATCACAGT TGAGCTGAAG 2220 

AAGGAGGAGG AATGTGAAAG CTACACTGTC CGAGACCTCC TGGTCACCAA CACGAGGGAG 2280 

AAGAAGAGTC GGCAAATCCG GCAGTTCCAC TTCCACGGCT GGCCTGAGGX GGGCATCCCC 2340 

AGCGACGGCA AGGGCATGAT CAACATCATT GCAGCAGTGC AGAAGCAGCA GCAGCAGTCG 2400 

r 

GGGAACCATC CCATCACTGT GCACTGCAGT GCCGGGGGAG GACGGACAGG AACCTTCTGT 2460 

GCCTTGAGCA CAGTCCTGGA ACGTGTGAAA GGAGAAGGAA TTTTAGATGT CTTCCAAACT 2520 

GTGAAGAGCC TGCGGCTGCA GAGGCCACAC ATGGTCCAGA CACTGGAACA GTATGAATTC 2580 

TGCTACAAGG TGGTACAGGA ATACATTGAC GCCTTTTCAG ATTATGCCAA CTTCAAGTGA 2640 

CAGGTGACAA GGCCCAGAGA CAGGAGAATT GCCTTTAATA TTTTG TAATA TTCTGTTTTT 2700 

GTTAATATAC CCAAAATTGT ATATATCTTA TAACTGTTTT AGAAATGGCA CATAGGCTTC 2760 

TATTACCTGT TAGATGGAGA TTTTGTATGT AAATGTGTTA GCACTGATAG TCCTT-TTCCA 2820 

GTGTTTTATT GGGAAATTAA TAGTGTGATA TTTGGGTTGA TATAATGAAT TC 2872 



(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 235 amino acids 

(B) TYPE: amino acid 

(C) STRANDE0NESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

Asn Gin Asn Lys Asn Arg Tyr Val Asp lie Leu Pro Tyx Asp Tyr Asn 
1 5 10 ^ 15 

Arg Val Glu Leu Ser Glu lie Asn Gly Asp Ala Gly Ser Asn Tyr lie 
20 25 30 

Asn Ala Ser Tyr lie Asp Gly Phe Lys Glu Pro Arg Lys Tyr lie Ala 
35 40 45 

Ala Gin Gly Pro Arg Asp Glu Thr Val Asp Asp Phe Trp Arg Met He 
50 55 60 

Trp Glu Gin Lys Ala Thr Val He Val Met Val Thr Arg Cys Glu Glu 
65 70. 75 80 

Gly Asn Arg Asn Lys Cys Ala Glu Tyr Trp Pro Ser Met Glu Glu Gly 
85 90 95 

Thr Arg Ala Phe Gly Asp Val Val Val Lys He Asn Gin His Lys Arg 
100 105 110 

Cys Pro Asp Tyr He He Gin Lys Leu Asn He Val Asn Lys Lys Glu 
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ns 120 125 

Lys Ala Thr Gly Arg Glu Val Thr His He Gin Phe Thr Ser Trp Pro 
130 135 140 

Asp His Gly Val Pro Glu Asp Pro His Leu Leu Leu Lys Leu Arg Arg 
"5 150 155 160 

Arg Val Asn Ala Phe Ser Asn Phe Phe Ser Gly Pro He Val Val His 
165 170 175 

Cys Ser Ala Gly Val Gly Arg Thr Gly Thr Tyr He Gly He Asp Ala 
180 185 190 

Met Leu Glu Gly Leu Glu Ala Glu Asn Lys Val Asp Val Tyr Gly Tvr 
195 200 205 

Val Val Lys Leu Arg Arg Gin Arg Cys Leu Met Val Gin Val Glu Ala 
210 215 220 

Gin Tyr He Leu He His Gin Ala Leu Val Glu 
225 230 235 

INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 236 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: unknown 

(±i) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6; 

Asn Lys Glu Lys Asn Arg Tyr Val Asn lie Leu Pro Tyr Asp His Ser 
• 1 5 10 15 

Arg Val His Leu Thr Pro Val Glu Gly Val Pro Asp Ser Asp Tyr He 
20 25 30 

Asn Ala Ser Phe He Asn Gly Tyr Gin Glu Lys Asn Lys Phe lie Ala 
35 40 45 

Ala Gin Gly Pro Lys Glu Glu Thr Val Asn Asp Phe Trp Arg Met He 
50 55 60 

Trp Glu Gin Asn Thr Ala Thr lie Val Met Val Thr Asn Leu Lys Glu 
65 70 75 80 

Arg Lys Glu Cys Lys Cys Ala Gin Tyr Trp Pro Asp Gin Gly Glu Trp 
85 90 95 

Thr Tyr Gly Asn lie Arg Val Ser Val Glu Asp Val Thr Val Leu Val 
100 105 HO 

Asp Tyr Thr Val Arg Lys Phe Cys He Gin Gin Val Gly Asp Met Thr 
115 120 125 

Asn Arg Lys Pro Gin Arg Leu lie Thr Gin Phe His Phe Thr Ser Trp 
130 135 140 

Pro Asp Phe Gly Val Pro Phe Thr Pro He Gly Met Leu Lys Phe Leu 
145 150 155 160 

Lys Lys Val Lys Ala Cys Asn Pro Gin Tyr Ala Gly Ala He Val Val 
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165 170 175 

His Cys Ser Ala Gly Val Gly Arg Thr Gly Thr Phe Val Val lie Asp 
ISO 185 190 

Ala Met Leu Asp Met Met His Thr Glu Arg Lys Val Asp Val Tyr Gly 
195 200 205 

Phe Val Ser Arg He Arg Ala Gin Arg Cys Gin Met Val Gin Thr Asp 
210 215 220 

Met Gin Tyr Val Phe lie Tyr Gin Ala Leu Leu Glu 
225 230 ' 235 

[2) INFORMATION FOR SEQ ID NOs7: 

(i) SEQUENCE CHARACTERISTICS: 

<A) LENGTH: 242 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: unknown 

(ii> MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 

Asn Lye His Lys Asn Arg Tyr lie Asn lie Val Ala Tyr Asp His Ser 
1 5 10 15 

Arg Val Lys Leu Ala Gin Leu Ala Glu Lys Asp Gly Lys Leu Thr Asp 
20 25 30 

Tyr lie Asn Ala Asn Tyr Val Asp Gly Tyr Asn Arg Pro Lys Ala Tyr 
35 40 45 

lie Ala Ala Gin Gly Pro Leu Lys Ser Thr Ala Glu Asp Phe Trp Arg 
50 55 60 

Met He Trp Glu His Asn Val Glu Val He Val 'Met He Thr Asn Leu 
65 70 75 80 

Val Glu Lys Gly Arg Arg Lys Cys Asp Gin Tyr Trp Pro Ala Asp Gly 
85 90 95 

Ser Glu Glu Tyr Gly Asn Phe Leu Val Thr Gin Lys Ser Val Gin Val 
100 105 110 

Leu Ala Tyr Tyr Thr Val Arg Asn Phe Thr Leu Arg Asn Thr Lys He 
115 120 125 

Lys Lys Gly Ser Gin Lys Gly Arg Pro Ser Gly Arg Val Val Thr Gin 
130 135 140 

Tyr His Tyr Thr Gin Trp Pro Asp Met Gly Val Pro Glu Tyr Ser Leu 
145 150 155 * 160 

Pro Val Leu Thr Phe Val Arg Lys Ala Ala Tyr Ala Lys Arg His Ala 
165 170 175 

Val Gly Pro Val Val Val His Cys Ser Ala Gly Val Gly Arg Thr Gly 
180 185 190 

Thr Tyr He Val Leu Asp Ser Met Leu Gin Gin lie Gin His Glu Gly 
195 200 205 

Thr Val Asn He Phe Gly Phe Leu Lys His He Arg Ser Gin Arg Asn 
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210 215 220 

Tyr Leu Val Gin Thr Glu Glu Gin Tyr Val Phe He His Asp Thr Leu 
225 230 235 240 

Val Glu 

2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 245 amino acids 

(B) TYPE: amino acid 

(C) STRAND ED NESS : single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 

Asn Lys His Lys Asn Arg Tyr lie Asn lie Leu Ala Tyr Asp His Ser 
1 5 10 15 

Arg Val Lys Leu Arg Pro Leu Pro Gly Lys Asp Ser Lys His Ser Asp 
20 25 30 

Tyr lie Asn Ala Asn Tyr Val Asp Gly Tyr Asn Lys Ala Lys Ala Tyr 
35 40 45 

lie Ala Thr Gin Gly Pro Leu Lys Ser Thr Phe Glu Asp Phe Trp Arg 
50 ,55 60 

Met lie Trp Glu Gin Asn Thr Gly lie lie Val Met lie Thr Asn Leu 
65 70 75 80 

Val Glu Lys Gly Arg Arg Lys Cys Asp Gin Tyr Trp Pro Thr Glu Asn 

.85 90 95- 

Ser Glu Glu Tyr Gly Asn lie 'lie Val Thr Leu Lys Ser Thr Lys lie 
100 105 110 

His Ala Cys Tyr Thr Val Arg Arg Phe Ser lie Arg Asn Thr Lys Val 
115 120 125 

Lys Lys Gly Gin Lys Gly Asn Pro Lys Gly Arg Gin Asn Glu Arg Val 
130 135 140 

Val lie Gin Tyr His Tyr Thr Gin Trp Pro Asp Met Gly Val Pro Glu 
145 150 155 160 

Tyr Ala Leu Pro Val Leu Thr Phe Val Arg Arg Ser Ser Ala Ala Arg 
165 170 175 

Met Pro Glu Thr Gly Pro Val Leu Val His Cys Ser Ala Gly Val Gly 
180 185 190 

Arg Thr Gly Thr Tyr lie Val lie Asp Ser Met Leu Gin Gin lie Lys 
195 200 205 

Asp Lys Ser Thr Val Asn Val Leu Gly Phe Leu Lys His He Arg Thr 
210 215 220 

Gin Arg Asn Tyr Leu Val Gin Thr Glu Glu Gin Tyr He Phe He His 
225 230 235 240 

Asp Ala Leu Leu Glu 
245 
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(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 248 amino acids 

(B) TYPE: amino acid 

(C) STRANDEONESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 

(ix) FEATURE: ' 

(A) NAME /KEY: Mod if ied-aites 

(B) LOCATION: 1. .248 

(D) OTHER INFORMATION: /label= Xaa 

/note= "For the Consensus Sequence, Xaa = Lack of 
Consensus" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 

- Asn Lys His Lys Asn Arg Tyr Xaa Asn lie Leu Xaa Tyr Asp His Ser 
1 5 10 15 

Arg Val Lys Leu Xaa Xaa Leu Xaa Xaa Lys Xaa Xaa Lys Xaa Ser Asp 
20 25 30 

Tyr lie Asn Ala Xaa Tyr Xaa Asp Gly Tyr Asn Glu Pro Lys Xaa Tyr 
35 40 45 

lie Ala Ala Gin Gly Pro Leu Lys Xaa Thr Val Glu Asp Phe Trp Arg 
50 55 60 

Met lie Trp Glu Gin Asn Thr Xaa Val lie Val Met Xaa Thr Asn Leu 
65 70 75 80 

Val Glu Lys Gly Arg Arg Lys Cys Xaa Gin Tyr Trp Pro Xaa Xaa Gly 
85 90 95 . 

Ser Glu Xaa Tyr Gly Asn lie Xaa Val Thr Val Lys Xaa Val Xaa Val 
100 105 110 

Leu Ala Xaa Xaa Asp Tyr Thr Val Arg Lys Phe Xaa Xaa Arg Asn Thr 
115 120 125 

Lys lie Xaa Lys Xaa Gly Xaa Lys Xaa Xaa Xaa Lys Gly Arg Xaa Xaa 
130 135 140 

Gly Arg Val Val Thr Gin Tyr His Xaa Thr Xaa Trp Pro Asp Met Gly 
145 150 155 160 

Val Pro Glu Tyr Pro Leu Pro Val Leu Xaa Phe Val Arg Xaa Val Xaa 
165 170 175 

Ala Ala Xaa Xaa Xaa Xaa Xaa Gly Pro Xaa Val Val His Cys Ser Ala 
180 185 190 

Gly Val Gly Arg Thr Gly Thr Tyr lie Val lie Asp Xaa Met Leu Gin 
195 200 205 

Gin lie Xaa Xaa Glu Xaa Xaa Val Xaa Val Tyr Gly Phe Xaa Lys His 
210 215 220 

lie Arg Xaa Gin Arg Xaa Tyr Xaa Val Gin Thr Glu Glu Gin Tyr Xaa 
225 230 235 240 

Phe lie His Xaa Ala Leu Xaa Glu 
245 
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(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 260 amino acids 

(B) TYPE: amino acid 

(C) STRANDED NESS : single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

Asn Lys Ser Lys Asn Arg Asn Ser Asn Val lie Pro Tyr Asp Tyr Asn 
1 5 10 15 

Arg Val Pro Leu Lys His Glu Leu Glu Met Ser Lys Glu Ser Glu His 

.20 25 .30 

Asp Ser Asp Glu Ser Ser Asp Asp Asp Ser Asp Ser Glu Glu Pro Ser 
35 40 45 

Lys Tyr lie Asn Ala Ser Phe lie Met Ser Tyr Trp Lys Pro Glu Val 
50 55 60 

Met lie Ala Ala Gin Gly Pro Leu Lys Glu Thr lie Gly Asp Phe Trp 
65 70 75 80 

Gin Met lie Phe Gin Arg Lys Val Lys Val lie Val Met Leu Thr Glu 
.85 90 95 

Leu Lys His Gly Asp Gin Glu lie Cys Ala Gin Tyr Trp Gly Glu Gly 
100 105 110 

Lys Gin Thr Tyr Gly Asp lie Glu Val Asp Leu Lys Asp Thr Asp Lys 
115 120 125 

Ser Ser Thr Tyr Thr Leu Arg Val Phe Glu Leu Arg His Ser Lys Arg 
130 135 140 

Lys Asp Ser Arg Thr Val Tyr Gin Tyr Gin Tyr Thr Asn Trp Ser Val 
145 150 155 160 

Glu Gin Leu Pro Ala Glu Pro Lys Glu Leu lie Ser Met lie Gin Val 
165 170 175 

Val Lys Gin Lys Leu Pro Gin Lys Asn Ser Ser Glu Gly Asn Lys His 
180 185 190 

His Lys Ser Thr Pro Leu Leu lie His Cys Arg Asp Gly Ser Gin Gin 
195 200 205 

Thr Gly lie Phe Cys Ala Leu Leu Asn Leu Leu Glu Ser Ala Glu Thr 
210 215 220 

Glu Glu Val Val Asp lie Phe Gin Val Val Lys Ala Leu Arg Lys Ala 
225 230 235 240 

Arg Pro Gly Met Val Ser Thr Phe Glu Gin Tyr Gin Phe Leu Tyr Asp 
245 250 255 

Val lie Ala Ser 
260 



(2) INFORMATION FOR SEQ ID NO: 11: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 233 amino acids 

(B) TYPE: amino acid 

(C) STRAND ED NESS : single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

Asn Met Lys Lys Asn Arg Val Leu Gin He He Pro Tyr Glu Phe Asn 
* -5 10 15 

Arg Val He He Pro Val Lys Arg Gly Glu Glu Asn Thr Asp Tyr Val 
20 25 30 

Asn Ala Ser Phe He Asp Gly Tyr Arg Gin Lys Asp. Ser Tyr He Ala 
35 40 45 

Ser Gin Gly Pro Leu Leu His Thr He Glu Asp Phe Trp Arg Met He 
50 55 60 

Trp Glu Trp Lys Ser Cys Ser He Val Met Leu Thr Glu Leu Glu Glu 
65 70 75 80 

Arg Gly Gin Glu Lys Cys Ala Gin Tyr Trp Pro Ser Asp Gly Leu Val 
85 90 95 

Ser Tyr Gly Asp He Thr Val Glu Leu Lys Lys Glu Glu Glu Cys Glu 
100 105 110 

Ser Tyr Thr Val Arg Asp Leu Leu Val Thr Asn Thr Arg Glu Asn Lys 
115 120 125 

Ser Arg Gin He Arg Gin Phe His Phe His Gly Trp Pro Glu Val Gly 
130 135 140 

He Pro Ser Asp Gly Lys Gly Met He Ser He lie Ala Ala Val Gin 
145 150 155 160 

Lys Gin Gin Gin Gin Ser Gly Asn His Pro He Thr Val His Cys Ser 
165 170 175 

Ala Gly Ala Gly Arg Thr Gly Thr Phe Cys Ala Leu Ser Thr Val Leu 
180 185 190 

Glu Arg Val Lys Ala Glu Gly - He Leu Asp Val Phe Gin Thr Val Lys 
195 200 205 

Ser Leu Ala Leu Gin Arg Pro His Met Val Gin Thr Leu Glu Gin Tyr 
210 215 220 

Glu Phe Cys Tyr Lys Val Val Gin Glu 
225 230 

(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 234 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:12: 
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Asn Arg Glu Lys Asn Arg Thr Ser Ser lie lie Pro Val Glu Arg Ser 
1 5 10 15 

Arg Val Gly lie Ser Ser Leu Ser Gly Glu Gly Thr Asp Tyr lie Asn 
20 25 30 

Ala Ser Tyr lie Met Gly Tyr Tyr Gin Ser Asn Glu Phe lie lie Thr 
35 40 45 

Gin His Pro Leu Leu His Thr He Lye Asp Phe Trp Arg Met He Trp 
50 55 60 

Asp His Asn Ala Gin Leu Val Val Met lie Pro Asp Gly Gin Asn Met 
65 70 * 75 80 

Ala Glu Asp Glu Phe Val Tyr Trp Pro Asn Lys Asp Glu Pro lie Asn 
85 90 95 



Cys Glu Ser Phe Lys Val Thr Leu Met Ala Glu Glu His Lys Cys Leu 
100 105 110 

Ser Asn Glu Glu Lys Leu lie He Gin Asp Phe lie Leu Glu Ala Thr 
115 120 125 

Gin Asp Asp Tyr Val Leu Glu Val Arg His Phe Gin Cys Pro Lys Trp 
130 135 140 

Pro Asn Pro Asp Ser Pro He Ser Lys Thr Phe Glu Leu He Ser Val 
145 150 155 160 

He Lys Glu Glu Ala Ala Asn Arg Asp Gly Pro Met He Val His Asp 
165 170 175 

Glu His Gly Gly Val Thr Ala Gly Thr Phe Cys Ala Leu Thr Thr Leu 
180 185 190 

Met His Gin Leu Glu Lys Glu Asn Ser Val Asp Val Tyr Gin Val Ala 
195 200 205 

Lys Met He Asn Leu Met Arg Pro Gly Val Phe Ala Asp He Glu Gin 
210 215 220 

Tyr Gin Phe Leu Tyr Lys Val He Leu Ser 
225 230 

(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 235 amino acids 

(B) TYPE: amino acid 

(C) STRANDED NESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 

Asn Lys Glu Lys Asn Arg Asn Ser Ser Val Val Pro Ser Glu Arg Ala 
1 5 10 15 

Arg Val Gly Leu Ala Pro Leu Pro Gly Met Lys Gly Thr Asp Tyr He 
20 25 30 

Asn Ala Ser Tyr He Met Gly Tyr Tyr Arg Ser Asn Glu Phe He He 
35 40 45 
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Thr Gin His Pro Leu Pro His Thr Thr Lys Asp Phe Trp Arg Met lie 
50 55~ 60 

Trp Asp His Asn Ala Gin lie lie Val Met Leu Pro Asp Asn Gin Ser 
65 70 75 80 

Leu Ala Glu Asp Glu Phe Val Tyr Trp Pro Ser Arg Glu Glu Ser Met 
85 90 95 

Asn Cys Glu Ala Phe Thr Val Thr Leu lie Ser Lys Asp Arg Leu Cys 

100 105 110 

* 

Leu Ser Asn Glu Glu Gin lie lie lie His Aej> Phe lie Leu Glu Ala 
115 120 125 

Thr Gin Asp Asp Tyr Val Leu Glu Val Arg His Phe Gin Cys Pro Lys 
130 135 140 

* Trp Pro Asn Pro Asp Ala Pro lie Ser Ser Thr Phe Glu Leu lie Asn 
145 150 155 160 

Val lie Lys Glu Glu Ala Leu Thr Arg Asp Gly Pro Thr lie Val His 
165 170 175 

Asp Glu Tyr Gly Ala Val Ser Ala Gly Met Leu Cys Ala Leu Thr Thr 
180 185 190 

Leu Ser Gin Gin Leu Glu Asn Glu Asn Ala Val Asp Val Phe Gin Val 
195 200 205 

Ala Lys Met lie Asn Leu Met Arg Pro Gly Val Phe Thr Asp lie Glu 
210 215 220 

Glh Tyr Gin Phe lie Tyr Lys Ala Arg Leu Ser 
225 230 235 

(2) INFORMATION FOR SEQ ID NO: 14 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 280 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 

(ix) FEATURE: 

(A) NAME/KEY: Modif ied-sites 

(B) LOCATION: 1-.280 

(D) OTHER INFORMATION: /label= Xaa 

/note= -For the Consensus Sequence, Xaa - Lack of 
Consensus 1 * 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 

Asn Lys Glu Lys Asn Arg Asn Ser Ser Xaa lie Pro Tyr Glu Arg Asn 
1 5 10 15 

• Arg Val Gly Xaa Xaa Xaa Leu Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 

20 25 30 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Gly Glu Glu Gly Thr 
35 40 45 

Asp Tyr He Asn Ala Ser Xaa He Met Gly Tyr Tyr Gin Ser Asn Glu 
50 55 60 
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Phe lie Xaa Thr Gin Xaa Pro Leu Leu His Thr He Lys Asp Phe Trp 
65 70 75 80 

Arg Met He Trp Asp His Xaa Asn Ala Gin He Val Met Leu Xaa Xaa 
85 90 95 

Xaa Gin Xaa Xaa Ala Glu Xaa Glu Xaa Xaa Gin Tyr Trp Pro Ser Xaa 
10O 105 HO 

Gly Xaa Xaa Xaa Tyr Gly Asp Xaa Xaa Val Xaa Leu Lys Xaa Xaa Xaa 
115 120 125 

Asn Cys Glu Ser Xaa Thr Val Thr Xaa Xaa Xaa Glu Xaa Arg Xaa Cys 
130 * 135 140 

Leu Ser Asn Glu Xaa Arg Xaa He He Gin Asp Phe He Leu Glu Ala 
145 >Jl50 155 160 



Thr Gin Asp Asp Tyr Val Leu Glu Val Arg His Phe Gin Cys Pro Lys 
165 170 175 

Trp Pro Asn Pro Asp Xaa Pro He Ser Xaa Thr Xaa Glu Leu lie Ser 
180 185 190 

Val lie Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Gin Lys Xaa Glu Glu Ala 
19S 200 205 

Xaa Asn Arg Xaa Xaa Xaa Asp Gly Pro Xaa He Val His Xaa Glu Xaa 
210 215 220 

Gly Ala Val Xaa Xaa Gly Thr Phe Cys Ala Leu Thr Thr Leu Leu Glu 
225 230 235 240 

Gin Leu Glu Xaa Glu Asn Xaa Val Asp Val Phe Gin Val Xaa Lys Met 
245 250 255 

Xaa Asn Leu Met Arg Pro Gly Xaa Xaa Xaa Xaa He Glu Gin Tyr Gin 
260 265 270 

Phe Leu Tyr Lys Val He Leu Ser 
275 280 



This Page is Inserted by IFW Indexing and Scanning 
Operations and is not part of the Official Record 



Defective images within this document are accurate representations of the original 
documents submitted by the applicant. 

Defects in the images include but are not limited to the items checked: 

□ BLACK BORDERS 

□ IMAGE CUT OFF AT TOP, BOTTOM OR SIDES 

□ FADED TEXT OR DRAWING 

□ BLURRED OR ILLEGIBLE TEXT OR DRAWING 

□ SKEWED/SLANTED IMAGES 

□ COLOR OR BLACK AND WHITE PHOTOGRAPHS 

□ GRAY SCALE DOCUMENTS 



U LINES OR MARKS ON ORIGINAL DOCUMENT 

□ REFERENCE(S) OR EXHIBIT(S) SUBMITTED ARE POOR QUALITY 

□ OTHER: 

IMAGES ARE BEST AVAILABLE COPY. 
As rescanning these documents will not correct the image 
problems checked, please do not report these problems to 
the IFW Image Problem Mailbox. 



BEST AVAILABLE IMAGES 




